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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: OSUMI Chieko 

NOZAKI Jinshi 
KIDA Takao 

(ii) TITLE OF INVENTION: RAFFINOSE SYNTHASE GENE, METHOD FOR 

PRODUCING RAFFINOSE, AND TRANSGENIC PLANT 
(iii) NUMBER OF SEQUENCES: 22 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: OBLON, SPIVAK, MCCLELLAND, MAIER & NEUSTADT, P,C. 

(B) STREET: 1755 S. JEFFERSON DAVIS HIGHWAY, FOURTH FLOOR 

(C) CITY: ARLINGTON 

(D) STATE: VIRGINIA 

(E) COUNTRY: USA 

(F) ZIP: 22202 

(v) COMPUTER READABLE FORM: 
_ (A) MEDIUM TYPE: Floppy disk 

g (B) COMPUTER: IBM PC compatible 

£ (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

^! (D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 

5,^(vi) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: NORMAN F. OBLON 
2 (B) REGISTRATION NUMBER: 24,618 

gvii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: { 703 ) -4 13 -3 000 
S (B) TELEFAX: ( 703 ) -413 - 2220 




{ 2 ) ^^INFORMATION FOR SEQ ID NO : 1 : 

D (i) SEQUENCE CHARACTERISTICS: 
U (A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
Phe Gly Trp Cys Thr Trp Asp Ala Phe Tyr Leu Thr Val His Pro Gin 

15 10 15 

Gly Val lie Glu Gly Val Arg His Leu Val Asp Gly Gly Cys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
Pro Val Ser Val Gly Cys Phe Val Gly Phe Asp Ala Ser Glu Pro Asp 

1 5 10 15 

Ser Arg His 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
Tyr Asp Gin Asp Gin Met Val Val Val Gin Val Pro Trp Pro 
15 10 

(2) INFORMATION FOR SEQ ID NO : 4 : 

^ (i) SEQUENCE CHARACTERISTICS: 

y (A) LENGTH: 2517 base pairs 

^ (B) TYPE: nucleic acid 

rl (C) STRANDEDNESS: double 

(D) TOPOLOGY : 1 inear 

rjii) MOLECULE TYPE: cDNA to mRNA 

5vi) ORIGINAL SOURCE: 

(A) ORGANISM : cucumber (Cucumis sativas) 

L= (ix) FEATURE: 

P (A) NAME/KEY: CDS 

^ (B) LOCATION: 56. .2407 

i^xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
AAA^^CAAC CCTTCTTTTA GTTTTTTGGG TTTGTTTCTT CTTTTCTTCT CACAA ATG 58 

y Met 

U 1 



GCT 


CCT 


AGT 


TTT 


AAA 


AAT 


GGT 


GGC 


TCC 


AAC 


GTA 


GTT 


TCA 


TTT 


GAT 


GGC 


106 


Ala. 


Pro 


Ser 


Phe 


Lys 


Asn 


Gly 


Gly 


Ser 


Asn 


Val 


Val 


Ser 


Phe 


Asp 


Gly 










5 










10 










15 








TTA 


AAT 


GAC 


ATG 


TCG 


TCA 


CCG 


TTT 


GCA 


ATC 


GAC 


GGA 


TCG 


GAT 


TTC 


ACT 


154 


Leu 


Asn 


Asp 


Met 


Ser 


Ser 


Pro 


Phe 


Ala 


He 


Asp 


Gly 


Ser 


Asp 


Phe 


Thr 








20 










25 










30 










GTG 


AAC 


GGT 


CAT 


TCG 


TTT 


CTG 


TCC 


GAT 


GTT 


CCT 


GAG 


AAC 


ATT 


GTT 


GCT 


202 


Val 


Asn 


Gly 


His 


Ser 


Phe 


Leu 


Ser 


Asp 


Val 


Pro 


Glu 


Asn 


He 


Val 


Ala 






35 










40 










45 












TCT 


CCT 


TCT 


CCG 


TAC 


ACT 


TCG 


ATA 


GAC 


AAG 


TCC 


CCG 


GTT 


TCG 


GTT 


GGT 


250 


Ser 


Pro 


Ser 


Pro 


Tyr 


Thr 


Ser 


He 


Asp 


Lys 


Ser 


Pro 


Val 


Ser 


Val 


Gly 




50 










55 










60 










65 




TGC 


TTT 


GTT 


GGA 


TTC 


GAC 


GCG 


TCG 


GAA 


CCT 


GAT 


AGC 


CGA 


CAT 


GTT 


GTT 


298 


Cys 


Phe 


Val 


Gly 


Phe 


Asp 


Ala 


Ser 


Glu 


Pro 


Asp 


Ser 


Arg 


His 


Val 


Val 












70 










75 










80 






TCG 


ATT 


GGG 


AAG 


CTG 


AAG 


GAT 


ATT 


CGG 


TTT 


ATG 


AGT 


ATT 


TTC 


AGG 


TTT 


346 


Ser 


He 


Gly 


Lys 


Leu 


Lys 


Asp 


He 


Arg 


Phe 


Met 


Ser 


He 


Phe 


Arg 


Phe 
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85 

AAG GTT TGG TGG 
Lys Val Trp Trp 
100 

GAA TCG GAG ACT 
Glu Ser Glu Thr 
115 

CCG TAT GTT TTC 
Pro Tyr Val Phe 
130 

ATT CAG OCT GGG 
lie Gin Pro Gly 

TCG TCG A7^ GTT 
Ser Ser Lys Val 
165 

GCT GGT GAT GAT 
Ala ^ly Asp Asp 

d 180 
AGGUACC CAT CTT 
Arg'^rhr His Leu 

GGTl^TC GTG GAG 
Glytile Val Asp 

2107 

ACG^TT CAT CCA 
Thr.Wal His Pro 

GGC^GT TGT CCT 
GlyQjly Cys Pro 
O 245 
ATC GGA CAC GAT 
lie Gly His Asp 
260 

GTC GCC GGC GAG 
Val Ala Gly Glu 
275 

TAC AAA TTC CGT 
Tyr Lys Phe Arg 
290 

GGC CAG AAG GGG 
Gly Gin Lys Gly 

AAG ACT GTG GAG 
Lys Thr Val Glu 
325 

GGT GGC CTT CGC 
Gly Gly Leu Arg 
340 



ACT ACA CAC TGG 
Thr Thr His Trp 
105 

CAG ATT GTG ATC 
Gin lie Val lie 
120 

CTT CTT CCG ATC 
Leu Leu Pro lie 
135 

GAT GAT GAC TTT 
Asp Asp Asp Phe 
150 

GTT GAT GCA TCG 
Val Asp Ala Ser 

CCG TTT GCA CTT 
Pro Phe Ala Leu 
185 

GGA ACT TTT CGC 
Gly Thr Phe Arg 
200 

AAA TTC GGT TGG 
Lys Phe Gly Trp 
215 

CAG GGC GTA ATA 
Gin Gly Val lie 
230 

CCC GGT TTA GTC 
Pro Gly Leu Val 

TCG GAT CCC ATC 
Ser Asp Pro lie 
265 

CAA ATG CCC TGC 
Gin Met Pro Cys 
280 

GAC TAC GTC AAT 
Asp Tyr Val Asn 
295 

ATG AAG GCG TTT 
Met Lys Ala Phe 
310 

CAT GTT TAT GTT 
His Val Tyr Val 

CCG CAG GTG CCT 
Pro Gin Val Pro 
345 



90 

GTT GGT CGA AAT 
Val Gly Arg Asn 

CTT GAG AAG TCA 
Leu Glu Lys Ser 
125 

GTT GAG GGA CCG 
Val Glu Gly Pro 
140 

GTC GAT GTT TGT 
Val Asp Val Cys 
155 

TTC CGA AGT ATG 
Phe Arg Ser Met 
170 

GTT AAA GAG GCG 
Val Lys Glu Ala 

TTG TTG GAG GAG 
Leu Leu Glu Glu 
- 205 

TGC ACG TGG GAC 
Cys Thr Trp Asp 
220 

GAA GGC GTG AGG 
Glu Gly Val Arg 
235 

CTA ATC GAC GAT 
Leu lie Asp Asp 
250 

ACC AAA GAA GGA 
Thr Lys Glu Gly 

CGT CTT TTG AAA 
Arg Leu Leu Lys 
285 

CCC AAG GCC ACC 
Pro Lys Ala Thr 
300 

ATA GAT GAA CTC 
lie Asp Glu Leu 
315 

TGG CAT GCT TTG 
Trp His Ala Leu 
330 

GGC TTG CCT GAG 
Gly Leu Pro Glu 



95 

GGT GGG GAT CTT 
Gly Gly Asp Leu 
110 

GAT TCT GGT CGA 
Asp Ser Gly Arg 

TTC CGA ACC TCG 
Phe Arg Thr Ser 
145 

GTC GAG AGT GGT 
Val Glu Ser Gly 
160 

TTG TAT CTT CAT 
Leu Tyr Leu His 
175 

ATG AAG ATC GTG 
Met Lys lie Val 
190 

AAG ACT CCA CCA 
Lys Thr Pro Pro 

GCG TTT TAC CTA 
Ala Phe Tyr Leu 
225 

CAT CTC GTC GAC 
His Leu Val Asp 
240 

GGT TGG CAA TCC 
Gly Trp Gin Ser 
255 

ATG AAC CAA ACC 
Met Asn Gin Thr 
270 

TTC CAA GAG AAT 
Phe Gin Glu Asn 

GGC CCC CGA GCC 
Gly Pro Arg Ala 
305 

AAA GGA GAG TTT 
Lys Gly Glu Phe 
320 

TGT GGA TAT TGG 
Cys Gly Tyr Trp 
335 

GCA CGT GTG ATT 
Ala Arg Val lie 
350 
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CAG CCA GTG CTT TCA CCA GGG CTG CAG ATG ACG ATG GAG GAT TTG GCG 1162 
Gin Pro Val Leu Ser Pro Gly Leu Gin Met Thr Met Glu Asp Leu Ala 

355 360 365 

GTG GAT AAG ATT GTT CTT CAT AAG GTC GGG CTG GTC CCG CCG GAG AAG 1210 
Val Asp Lys lie Val Leu His Lys Val Gly Leu Val Pro Pro Glu Lys 
370 375 380 385 

GCT GAG GAG ATG TAG GAA GGA CTT CAT GCT CAT TTG GAA AAA GTT GGG 12 58 

Ala Glu Glu Met Tyr Glu Gly Leu His Ala His Leu Glu Lys Val Gly 

390 395 400 

ATC GAC GGT GTT AAG ATT GAC GTT ATC CAC CTA TTG GAG ATG TTG TGT 13 06 

lie Asp Gly Val Lys lie Asp Val lie His Leu Leu Glu Met Leu Cys 

405 410 415 

GAA GAC TAT GGA GGG AGA GTG GAT TTG GCA AAG GCA TAT TAC A7^ GCA 13 54 

Glu Asp Tyr Gly Gly Arg Val Asp Leu Ala Lys Ala Tyr Tyr Lys Ala 

420 425 430 

ATG ACC AAA TCA ATA AAT AAA CAT TTT AAA GGA AAT GGA GTC ATT GCA 14 02 

Met Thr Lys Ser lie Asn Lys His Phe Lys Gly Asn Gly Val lie Ala 

,^35 440 445 

AGT k^TG GAA CAT TGT AAC GAC TTC ATG TTC CTT GGC ACG GAA GCT ATC 14 5 0 

SerSSet Glu His Cys Asn Asp Phe Met Phe Leu Gly Thr Glu Ala lie 
450%^ 455 460 465 

TCTy&TT GGT CGT GTT GGT GAT GAC TTT TGG TGC ACG GAC CCC TCT GGT 14 98 

Ser^jeu Gly Arg Val Gly Asp Asp Phe Trp Cys Thr Asp Pro Ser Gly 

y 470 475 480 

GAT^CA AAC GGT ACG TTT TGG CTC CAA GGA TGT CAC ATG GTT CAT TGT 154 6 

Aspl^ro Asn Gly Thr Phe Trp Leu Gin Gly Cys His Met Val His Cys 

485 490 495 

GCC2|^C GAC AGC TTG TGG ATG GGG AAC TTC ATC CAC CCT GAC TGG GAT 1594 
AlaJS^sn Asp Ser Leu Trp Met Gly Asn Phe lie His Pro Asp Trp Asp 

p 500* 505 510 

ATGQrTC CAA TCC ACC CAC CCT TGT GCC GCC TTC CAT GCT GCC TCT CGA 1642 
Met^^Phe Gin Ser Thr His Pro Cys Ala Ala Phe His Ala Ala Ser Arg 

515 520 525 

GCC ATC TCT GGT GGC CCG ATC TAT GTT AGT GAT TCT GTG GGA AAG CAT 1690 
Ala lie Ser Gly Gly Pro lie Tyr Val Ser Asp Ser Val Gly Lys His 
530 535 540 545 

AAC TTT GAT CTT CTG AAA AAA CTA GTG CTT CCT GAT GGA TCG ATC CTT 173 8 

Asn Phe Asp Leu Leu Lys Lys Leu Val Leu Pro Asp Gly Ser lie Leu 

550 555 560 

CGA AGT GAG TAC TAT GCA CTC CCG ACT CGC GAT TGT TTG TTT GAA GAC 17 86 

Arg Ser Glu Tyr Tyr Ala Leu Pro Thr Arg Asp Cys Leu Phe Glu Asp 

565 570 575 

CCT TTG CAT AAT GGA GAA ACT ATG CTT AAG ATT TGG AAT CTC AAC AAG 1834 
Pro Leu His Asn Gly Glu Thr Met Leu Lys lie Trp Asn Leu Asn Lys 

580 585 590 

TTC ACT GGA GTG ATT GGT GCA TTC AAC TGC CAA GGA GGA GGA TGG TGT 18 82 

Phe Thr Gly Val lie Gly Ala Phe Asn Cys Gin Gly Gly Gly Trp Cys 

595 600 605 

CGT GAG ACA CGC CGC AAC CAA TGC TTT TCA CAA TAC TCA A7^ CGA GTG 193 0 
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# 



Arg Glu Thr Arg Arg Asn Gin Cys Phe Ser Gin Tyr Ser Lys Arg Val 

610 615 620 625 

ACA TCC AAA ACT AAC CCA AAA GAC ATA GAA TGG CAC AGT GGA GAA AAC 197 8 

Thr Ser Lys Thr Asn Pro Lys Asp lie Glu Trp His Ser Gly Glu Asn 

630 635 640 

CCT ATC TCT ATT GAA GGC GTT AAA ACC TTT GCG CTT TAG CTC TAT CAA 2026 
Pro lie Ser lie Glu Gly Val Lys Thr Phe Ala Leu Tyr Leu Tyr Gin 

645 650 655 

GCC AAA AAA CTT ATC CTC TCC AAG CCC TCT CAA GAT CTT GAC ATA GCT 2 074 

Ala Lys Lys Leu lie Leu Ser Lys Pro Ser Gin Asp Leu Asp lie Ala 

660 665 670 

CTT GAC CCA TTC GAA TTC GAG CTC ATC ACT GTT TCA CCA GTG ACC AAA 2122 
Leu Asp Pro Phe Glu Phe Glu Leu lie Thr Val Ser Pro Val Thr Lys 

675 680 685 

CTC ATC CAA ACT TCT CTA CAC TTT GCC CCA ATT GGG CTG GTG AAC ATG 2170 
Leu lie Gin Thr Ser Leu His Phe Ala Pro lie Gly Leu Val Asn Met 
690 _ 695 700 705 

CTT ^C ACT AGT GGA GCC ATC CAA TCT GTG GAC TAT GAC GAT GAC CTA 2218 
Leu msn Thr Ser Gly Ala lie Gin Ser Val Asp Tyr Asp Asp Asp Leu 

fj 710 715 720 

AGCTSCA GTC GAG ATT GGT GTC AAA GGG TGT GGT GAG ATG CGA GTA TTT 2266 
Server Val Glu lie Gly Val Lys Gly Cys Gly Glu Met Arg Val Phe 

Q 725 730 735 

GCAm^CG AAA AAA CCA AGG GCT TGT CGT ATT GAT GGG GAG GAT GTT GGG 2314 
Ala £ Ser Lys Lys Pro Arg Ala Cys Arg lie Asp Gly Glu Asp Val Gly 

□ 740 745 750 

TTC^5^G TAT GAT CAG GAC CAA ATG GTG GTG GTT CAA GTG CCA TGG CCA 23 62 

PheRLys Tyr Asp Gin Asp Gin Met Val Val Val Gin Val Pro Trp Pro 

£755 760 765 

ATTMSAT TCT TCA TCG GGT GGC ATT TCG GTT ATC GAG TAC TTG TTT 24 07 

Ile^sp Ser Ser Ser Gly Gly lie Ser Val lie Glu Tyr Leu Phe 
770 775 780 

TAATTTTTAT TTATGTAAGC TCAATGATTG TTGTTGTTGT CGCTGTTGTT GCTATCAATG 24 67 

TATTTCTCTC CAAAAGAAAA TTATGTGTAA TTTGGAGAGT AATTAAGTGA 2517 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
Met Ala Pro Ser Phe Lys Asn Gly Gly Ser Asn Val Val Ser Phe Asp 

15 10 15 

Gly Leu Asn Asp Met Ser Ser Pro Phe Ala lie Asp Gly Ser Asp Phe 

20 25 30 

Thr Val Asn Gly His Ser Phe Leu Ser Asp Val Pro Glu Asn lie Val 

35 40 45 

Ala Ser Pro Ser Pro Tyr Thr Ser lie Asp Lys Ser Pro Val Ser Val 
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50 



Gly Cys 


Phe 


Val 


65 






Val Ser 


He 


Gly 


Phe Lys 


Val 


Tirp 






100 


Leu Glu 


Ser 


Glu 




115 




Arg Pro 


Tyr 


Val 


130 






Ser lie 


Gin 


Pro 


145 






Gly Ser 


Ser 


Lys 


His Ala 


Gly Asp 






1 ft n 

X o w 


Val J^rg 


X li.x 


nx o 




195 




ProSsiv 


J. X c 


V d X 


^210 






Leuf Ahr 


VAX 


nx ID 


225Si 






AspQjly 


Gly 


Cys 


Sers He 


Glv 


His 






260 


Thr-^al 


CI 


w X 2 




275 




Asn'^Tyr 
— J 


Lys 


Phe 








aHjlv 


Gin 


Lys 


305 






Phe Lys 


Thr 


Val 


Trp Gly 


Gly 


Leu 






340 


He Gin 


Pro 


Val 




355 




Ala Val 


Asp 


Lys 


370 






Lys Ala 


Glu 


Glu 


385 






Gly He 


Asp 


Gly 


Cys Glu 


Asp 


Tyr 






420 


Ala Met 


Thr 


Lys 




435 





55 

Gly Phe Asp Ala 
70 

Lys Leu Lys Asp 
85 

Trp Thr Thr His 

Thr Gin He Val 
120 

Phe Leu Leu Pro 
135 

Gly Asp Asp Asp 
150 

Val Val Asp Ala 
165 

Asp Pro Phe Ala 

Leu Gly Thr Phe 
200 

Asp Lys Phe Gly 
215 

Pro Gin Gly Val 
230 

Pro Pro Gly Leu 
245 

Asp Ser Asp Pro 

Glu Gin Met Pro 
280 

Arg Asp Tyr Val 
295 

Gly Met Lys Ala 
310 

Glu His Val Tyr 
325 

Arg Pro Gin Val 

Leu Ser Pro Gly 
360 

He Val Leu His 
375 

Met Tyr Glu Gly 
390 

Val Lys He Asp 
405 

Gly Gly Arg Val 

Ser He Asn Lys 
440 



60 

Ser Glu Pro Asp 
75 

He Arg Phe Met 
90 

Trp Val Gly Arg 
105 

He Leu Glu Lys 

He Val Glu Gly 
140 

Phe Val Asp Val 
155 

Ser Phe Arg Ser 
170 

Leu Val Lys Glu 
185 

Arg Leu Leu Glu 

Trp Cys Thr Trp 
220 

He Glu Gly Val 
235 

Val Leu He Asp 
250 

He Thr Lys Glu 
265 

Cys Arg Leu Leu 

Asn Pro Lys Ala 
300 

Phe He Asp Glu 
315 

Val Trp His Ala 
330 

Pro Gly Leu Pro 
345 

Leu Gin Met Thr 

Lys Val Gly Leu 
380 

Leu His Ala His 
395 

Val He His Leu 
410 

Asp Leu Ala Lys 
425 

His Phe Lys Gly 



Ser Arg His Val 
80 

Ser He Phe Arg 
95 

Asn Gly Gly Asp 
110 

Ser Asp Ser Gly 
125 

Pro Phe Arg Thr 

Cys Val Glu Ser 
160 

Met Leu Tyr Leu 
175 

Ala Met Lys He 
190 

Glu Lys Thr Pro 
205 

Asp Ala Phe Tyr 

Arg His Leu Val 
240 

Asp Gly Trp Gin 
255 

Gly Met Asn Gin 
270 

Lys Phe Gin Glu 
285 

Thr Gly Pro Arg 

Leu Lys Gly Glu 
320 

Leu Cys Gly Tyr 
335 

Glu Ala Arg Val 
350 

Met Glu Asp Leu 
365 

Val Pro Pro Glu 

Leu Glu Lys Val 
400 

Leu Glu Met Leu 
415 

Ala Tyr Tyr Lys 
430 

Asn Gly Val He 
445 
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Ala Ser Met Glu 
450 

lie Ser Leu Gly 
465 

Gly Asp Pro Asn 

Cys Ala Asn Asp 
500 

Asp Met Phe Gin 
515 

Arg Ala lie Ser 
530 

His Asn Phe Asp 
545 

Leu Arg Ser Glu 

Asp^ro Leu His 

~ 580 

Lys^he Thr Gly 
%J 595 

Cysfjlrg Glu Thr 

Seio 

ValCThr Ser Lys 
625K 

Asns Pro lie Ser 

Gln^la Lys Lys 
^ 660 

Ala^eu Asp Pro 
S ^75 

LysHLeu lie Gin 
690 

Met Leu Asn Thr 
705 

Leu Ser Ser Val 

Phe Ala Ser Lys 
740 

Gly Phe Lys Tyr 
755 

Pro lie Asp Ser 
770 



# 

His Cys Asn Asp 
455 

Arg Val Gly Asp 
470 

Gly Thr Phe Trp 
485 

Ser Leu Trp Met 

Ser Thr His Pro 
520 

Gly Gly Pro lie 
535 

Leu Leu Lys Lys 
550 

Tyr Tyr Ala Leu 
565 

Asn Gly Glu Thr 

Val lie Gly Ala 
600 

Arg Arg Asn Gin 
615 

Thr Asn Pro Lys 
630 

lie Glu Gly Val 
645 

Leu lie Leu Ser 

Phe Glu Phe Glu 
680 

Thr Ser Leu His 
695 

Ser Gly Ala lie 
710 

Glu lie Gly Val 
725 

Lys Pro Arg Ala 

Asp Gin Asp Gin 
760 

Ser Ser Gly Gly 
775 



Phe Met Phe Leu 
460 

Asp Phe Trp Cys 
475 

Leu Gin Gly Cys 
490 

Gly Asn Phe lie 
505 

Cys Ala Ala Phe 

Tyr Val Ser Asp 
540 

Leu Val Leu Pro 
555 

Pro Thr Arg Asp 
570 

Met Leu Lys lie 
585 

Phe Asn Cys Gin 

Cys Phe Ser Gin 
620 

Asp lie Glu Trp 
635 

Lys Thr Phe Ala 
650 

Lys Pro Ser Gin 
665 

Leu lie Thr Val 

Phe Ala Pro lie 
700 

Gin Ser Val Asp 
715 

Lys Gly Cys Gly 
730 

Cys Arg lie Asp 
745 

Met Val Val Val 

lie Ser Val lie 
780 



Gly Thr Glu Ala 

Thr Asp Pro Ser 
480 

His Met Val His 
495 

His Pro Asp Trp 
510 

His Ala Ala Ser 
525 

Ser Val Gly Lys 

Asp Gly Ser lie 
560 

Cys Leu Phe Glu 
575 

Trp Asn Leu Asn 
590 

Gly Gly Gly Trp 
605 

Tyr Ser Lys Arg 

His Ser Gly Glu 
640 

Leu Tyr Leu Tyr 
655 

Asp Leu Asp lie 
670 

Ser Pro Val Thr 
685 

Gly Leu Val Asn 

Tyr Asp Asp Asp 
720 

Glu Met Arg Val 
735 

Gly Glu Asp Val 
750 

Gin Val Pro Trp 
765 

Glu Tyr Leu Phe 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
TTYTAYCTBA CHGTNCAYCC TCA 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
TTYTAYCTBA CHGTNCAYCC CCA 



(2) INFORMATION FOR SEQ ID NO : 8 : 

g(i) SEQUENCE CHARACTERISTICS: 

3 (A) LENGTH: 23 base pairs 

m (B) TYPE: nucleic acid 

SI (C) STRANDEDNESS: single 

Ui (D) TOPOLOGY: linear 

TMii) MOLECULE TYPE: other nucleic acid 

D (A) DESCRIPTION: /desc= "Synthetic DNA" 

S(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

TTYTAYCTBA CHGTNCAYCC ACA 

s — = 

(2) Information for seq id no:9: 
]^ (i) sequence characteristics: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TTYTAYCTBA CHGTNCAYCC GCA 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



(D) OTHER INFORMATION: N at 6 and 11 = inosine 

Other N = A, G, C, or T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GARGGNGTNM GNCAYCTRGT NGAYGG 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(D) OTHER INFORMATION: N at 6 and 11 = inosine 
□ Other N = A, G, C, or T 

^j[xi) sequence description: seq id no: 11: 
garc^ngtnm gncayctygt ngaygg 

(2) ^Information for seq id no:12: 

2 (i) sequence CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 
^ (B) TYPE: nucleic acid 

L (C) STRANDEDNESS: single 

2 (D) TOPOLOGY: linear 

nj(ii) MOLECULE TYPE: other nucleic acid 

jfl (A) DESCRIPTION: /desc= "Synthetic DNA" 

Q(ix) FEATURE: 

Q (A) NAME/KEY: 

(B) LOCATION: 

(D) OTHER INFORMATION: N at 6 and 11 = inosine 

Other N = A, G, C, or T 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
GARGGNGTNM GNCAYTTRGT NGAYGG 



(2) INFORMATION FOR SEQ ID NO:13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(D) OTHER INFORMATION: N at 3 = inosine 



other N = A, G, C, or T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTNGGNTGYT TYGTNGGYTT YGAYGC 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(D) OTHER INFORMATION: N at 3 = inosine 

Other N = A, G, C, or T 
d^xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GTN(^NTGYT TYGTNGGRTT YGAYGC 

(2) "^-INFORMATION FOR SEQ ID NO: 15: 

U1 (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 2 9 base pairs 

y (B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: single 

L (D) TOPOLOGY: linear 

"^(ii) MOLECULE TYPE: other nucleic acid 

Si (A) DESCRIPTION: /desc= "Synthetic DNA" 

!S(ix) FEATURE: 

5 (A) NAME/KEY: 

□ (B) LOCATION: 

(D) OTHER INFORMATION: N at 9 and 11 = inosine 

Other N = A, G, C, or T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TTYGAYGCNT CNGARCCHGA YTCDCGNCA 

(2) INFORMATION FOR SEQ ID N0:16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
( ix) FEATURE : 

(A) NAME/KEY: 

(B) LOCATION: 

(D) OTHER INFORMATION: N at 9 and 11 = inosine 

Other N = A, G, C, or T 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TTYGAYGCNT CNGARCCHGA YTCDAGYCAY 3 0 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GAYCARGAYC TRATGGTNGT 2 0 



(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
_ (A) LENGTH: 2 6 base pairs 

^ (B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: single 

^1 (D) TOPOLOGY: linear 

i^ii) MOLECULE TYPE: other nucleic acid 
^ (A) DESCRIPTION: /desc= "Synthetic DNA" 

Q[ix) FEATURE: 
m (A) NAME/KEY: 

(B) LOCATION: 

□ (D) OTHER INFORMATION: N at 6 and 15 = inosine 

Other N = A, G, C, or T 
ry(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
CCR^NACYA GRTGNCKNAC NCCYTC 26 

(2)OlNFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(D) OTHER INFORMATION: N at 6 and 15 = inosine 

Other N = A, G, C, or T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCRTCNACRA GRTGNCKNAC NCCYTC 26 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

/ (D) OTHER INFORMATION: N at 6 and 15 = inosine 

Other N = A, G, C, or T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CCRTCNACYA TRTGNCKNAC NCCYTC 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

□ (C) STRANDEDNESS: single 

C= (D) TOPOLOGY: linear 

.^'^ii) MOLECULE TYPE: other nucleic acid 

'?i (A) DESCRIPTION: /desc= "Synthetic DNA" 

y^ix) FEATURE: 

% (A) NAME/KEY: 

^ (B) LOCATION: 

(D) OTHER INFORMATION: N at 3 and 18 = inosine 
Q Other N = A, G, C, or T 

^Bxi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TGNpGHGART CDGGYTCNGA NGCRTCRAA 

(2) INFORMATION FOR SEQ ID NO: 22: 
D (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc= "Synthetic DNA" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(D) OTHER INFORMATION: N at 19 = inosine 

Other N = A, G, C, or T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
RTGRCTHGAR TCDGGYTCNG ANGCRTCRAA 
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